Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 403776 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 55.5 MiB |
| Average record size in memory | 144.0 B |
Variable types
| Numeric | 15 |
|---|---|
| Categorical | 3 |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 2 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 2 other fields | High correlation |
SO2 is highly correlated with CO | High correlation |
NO2 is highly correlated with PM2.5 and 2 other fields | High correlation |
CO is highly correlated with PM2.5 and 3 other fields | High correlation |
O3 is highly correlated with TEMP | High correlation |
TEMP is highly correlated with O3 and 2 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 2 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 2 other fields | High correlation |
SO2 is highly correlated with NO2 and 1 other fields | High correlation |
NO2 is highly correlated with PM2.5 and 4 other fields | High correlation |
CO is highly correlated with PM2.5 and 3 other fields | High correlation |
O3 is highly correlated with NO2 and 1 other fields | High correlation |
TEMP is highly correlated with O3 and 2 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
REF_NO is highly correlated with year | High correlation |
year is highly correlated with REF_NO | High correlation |
PM2.5 is highly correlated with PM10 and 1 other fields | High correlation |
PM10 is highly correlated with PM2.5 and 1 other fields | High correlation |
NO2 is highly correlated with CO | High correlation |
CO is highly correlated with PM2.5 and 2 other fields | High correlation |
TEMP is highly correlated with PRES and 1 other fields | High correlation |
PRES is highly correlated with TEMP and 1 other fields | High correlation |
DEWP is highly correlated with TEMP and 1 other fields | High correlation |
year is highly correlated with REF_NO | High correlation |
PM10 is highly correlated with CO and 2 other fields | High correlation |
month is highly correlated with PRES and 3 other fields | High correlation |
PRES is highly correlated with month and 3 other fields | High correlation |
REF_NO is highly correlated with year and 4 other fields | High correlation |
TEMP is highly correlated with month and 3 other fields | High correlation |
CO is highly correlated with PM10 and 2 other fields | High correlation |
PM2.5 is highly correlated with PM10 and 2 other fields | High correlation |
DEWP is highly correlated with month and 3 other fields | High correlation |
NO2 is highly correlated with PM10 and 2 other fields | High correlation |
RAIN is highly skewed (γ1 = 29.4497644) | Skewed |
REF_NO is uniformly distributed | Uniform |
station is uniformly distributed | Uniform |
hour has 16824 (4.2%) zeros | Zeros |
RAIN has 387119 (95.9%) zeros | Zeros |
WSPM has 10891 (2.7%) zeros | Zeros |
Reproduction
| Analysis started | 2021-08-19 12:22:42.888990 |
|---|---|
| Analysis finished | 2021-08-19 12:30:41.848384 |
| Duration | 7 minutes and 58.96 seconds |
| Software version | pandas-profiling v3.0.0 |
| Download configuration | config.json |
| Distinct | 33648 |
|---|---|
| Distinct (%) | 8.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16824.5 |
| Minimum | 1 |
|---|---|
| Maximum | 33648 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1683 |
| Q1 | 8412.75 |
| median | 16824.5 |
| Q3 | 25236.25 |
| 95-th percentile | 31966 |
| Maximum | 33648 |
| Range | 33647 |
| Interquartile range (IQR) | 16823.5 |
Descriptive statistics
| Standard deviation | 9713.352953 |
|---|---|
| Coefficient of variation (CV) | 0.5773338258 |
| Kurtosis | -1.200000002 |
| Mean | 16824.5 |
| Median Absolute Deviation (MAD) | 8412 |
| Skewness | 0 |
| Sum | 6793329312 |
| Variance | 94349225.58 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2047 | 12 | < 0.1% |
| 12113 | 12 | < 0.1% |
| 16211 | 12 | < 0.1% |
| 1876 | 12 | < 0.1% |
| 3925 | 12 | < 0.1% |
| 5974 | 12 | < 0.1% |
| 8023 | 12 | < 0.1% |
| 26456 | 12 | < 0.1% |
| 28505 | 12 | < 0.1% |
| 30554 | 12 | < 0.1% |
| Other values (33638) | 403656 |
| Value | Count | Frequency (%) |
| 1 | 12 | |
| 2 | 12 | |
| 3 | 12 | |
| 4 | 12 | |
| 5 | 12 | |
| 6 | 12 | |
| 7 | 12 | |
| 8 | 12 | |
| 9 | 12 | |
| 10 | 12 |
| Value | Count | Frequency (%) |
| 33648 | 12 | |
| 33647 | 12 | |
| 33646 | 12 | |
| 33645 | 12 | |
| 33644 | 12 | |
| 33643 | 12 | |
| 33642 | 12 | |
| 33641 | 12 | |
| 33640 | 12 | |
| 33639 | 12 |
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| 2016 | |
|---|---|
| 2015 | |
| 2014 | |
| 2013 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1615104 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2013 |
|---|---|
| 2nd row | 2013 |
| 3rd row | 2013 |
| 4th row | 2013 |
| 5th row | 2013 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 105408 | |
| 2015 | 105120 | |
| 2014 | 105120 | |
| 2013 | 88128 |
Length
Pie chart
| Value | Count | Frequency (%) |
| 2016 | 105408 | |
| 2014 | 105120 | |
| 2015 | 105120 | |
| 2013 | 88128 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1615104 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1615104 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1615104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 403776 | |
| 0 | 403776 | |
| 1 | 403776 | |
| 6 | 105408 | 6.5% |
| 4 | 105120 | 6.5% |
| 5 | 105120 | 6.5% |
| 3 | 88128 | 5.5% |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6.735378031 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.356479072 |
|---|---|
| Coefficient of variation (CV) | 0.4983356623 |
| Kurtosis | -1.157296025 |
| Mean | 6.735378031 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.0532691034 |
| Sum | 2719584 |
| Variance | 11.26595176 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 35712 | |
| 10 | 35712 | |
| 8 | 35712 | |
| 7 | 35712 | |
| 5 | 35712 | |
| 3 | 35712 | |
| 11 | 34560 | |
| 9 | 34560 | |
| 6 | 34560 | |
| 4 | 34560 | |
| Other values (2) | 51264 |
| Value | Count | Frequency (%) |
| 1 | 26784 | |
| 2 | 24480 | |
| 3 | 35712 | |
| 4 | 34560 | |
| 5 | 35712 | |
| 6 | 34560 | |
| 7 | 35712 | |
| 8 | 35712 | |
| 9 | 34560 | |
| 10 | 35712 |
| Value | Count | Frequency (%) |
| 12 | 35712 | |
| 11 | 34560 | |
| 10 | 35712 | |
| 9 | 34560 | |
| 8 | 35712 | |
| 7 | 35712 | |
| 6 | 34560 | |
| 5 | 35712 | |
| 4 | 34560 | |
| 3 | 35712 |
day
Real number (ℝ≥0)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.74821683 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.808891484 |
|---|---|
| Coefficient of variation (CV) | 0.5593580262 |
| Kurtosis | -1.195325155 |
| Mean | 15.74821683 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.005682826695 |
| Sum | 6358752 |
| Variance | 77.59656917 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 13248 | 3.3% |
| 15 | 13248 | 3.3% |
| 2 | 13248 | 3.3% |
| 3 | 13248 | 3.3% |
| 4 | 13248 | 3.3% |
| 5 | 13248 | 3.3% |
| 6 | 13248 | 3.3% |
| 7 | 13248 | 3.3% |
| 8 | 13248 | 3.3% |
| 9 | 13248 | 3.3% |
| Other values (21) | 271296 |
| Value | Count | Frequency (%) |
| 1 | 13248 | |
| 2 | 13248 | |
| 3 | 13248 | |
| 4 | 13248 | |
| 5 | 13248 | |
| 6 | 13248 | |
| 7 | 13248 | |
| 8 | 13248 | |
| 9 | 13248 | |
| 10 | 13248 |
| Value | Count | Frequency (%) |
| 31 | 7776 | |
| 30 | 12384 | |
| 29 | 12672 | |
| 28 | 13248 | |
| 27 | 13248 | |
| 26 | 13248 | |
| 25 | 13248 | |
| 24 | 13248 | |
| 23 | 13248 | |
| 22 | 13248 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.5 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 16824 |
| Zeros (%) | 4.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5.75 |
| median | 11.5 |
| Q3 | 17.25 |
| 95-th percentile | 22 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 11.5 |
Descriptive statistics
| Standard deviation | 6.922195124 |
|---|---|
| Coefficient of variation (CV) | 0.6019300108 |
| Kurtosis | -1.204173965 |
| Mean | 11.5 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0 |
| Sum | 4643424 |
| Variance | 47.91678534 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 16824 | 4.2% |
| 22 | 16824 | 4.2% |
| 1 | 16824 | 4.2% |
| 2 | 16824 | 4.2% |
| 3 | 16824 | 4.2% |
| 4 | 16824 | 4.2% |
| 5 | 16824 | 4.2% |
| 6 | 16824 | 4.2% |
| 7 | 16824 | 4.2% |
| 8 | 16824 | 4.2% |
| Other values (14) | 235536 |
| Value | Count | Frequency (%) |
| 0 | 16824 | |
| 1 | 16824 | |
| 2 | 16824 | |
| 3 | 16824 | |
| 4 | 16824 | |
| 5 | 16824 | |
| 6 | 16824 | |
| 7 | 16824 | |
| 8 | 16824 | |
| 9 | 16824 |
| Value | Count | Frequency (%) |
| 23 | 16824 | |
| 22 | 16824 | |
| 21 | 16824 | |
| 20 | 16824 | |
| 19 | 16824 | |
| 18 | 16824 | |
| 17 | 16824 | |
| 16 | 16824 | |
| 15 | 16824 | |
| 14 | 16824 |
| Distinct | 866 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 79.32702142 |
| Minimum | 2 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 21 |
| median | 57 |
| Q3 | 109 |
| 95-th percentile | 237 |
| Maximum | 999 |
| Range | 997 |
| Interquartile range (IQR) | 88 |
Descriptive statistics
| Standard deviation | 78.31352866 |
|---|---|
| Coefficient of variation (CV) | 0.9872238648 |
| Kurtosis | 5.907034808 |
| Mean | 79.32702142 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 1.992182434 |
| Sum | 32030347.4 |
| Variance | 6133.008772 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 83 | 10228 | 2.5% |
| 3 | 8354 | 2.1% |
| 10 | 6609 | 1.6% |
| 11 | 6418 | 1.6% |
| 9 | 6374 | 1.6% |
| 12 | 6346 | 1.6% |
| 8 | 6333 | 1.6% |
| 13 | 5830 | 1.4% |
| 14 | 5765 | 1.4% |
| 7 | 5742 | 1.4% |
| Other values (856) | 335777 |
| Value | Count | Frequency (%) |
| 2 | 7 | < 0.1% |
| 3 | 8354 | |
| 4 | 3221 | 0.8% |
| 4.3 | 2 | < 0.1% |
| 4.4 | 1 | < 0.1% |
| 4.6 | 1 | < 0.1% |
| 5 | 3984 | |
| 6 | 5116 | |
| 7 | 5742 | |
| 7.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 999 | 1 | |
| 957 | 1 | |
| 941 | 1 | |
| 898 | 1 | |
| 882 | 1 | |
| 881 | 1 | |
| 857 | 1 | |
| 844 | 1 | |
| 826 | 1 | |
| 821 | 1 |
| Distinct | 1048 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 103.9992444 |
| Minimum | 2 |
|---|---|
| Maximum | 999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 11 |
| Q1 | 37 |
| median | 83 |
| Q3 | 144 |
| 95-th percentile | 275 |
| Maximum | 999 |
| Range | 997 |
| Interquartile range (IQR) | 107 |
Descriptive statistics
| Standard deviation | 89.47779529 |
|---|---|
| Coefficient of variation (CV) | 0.8603696673 |
| Kurtosis | 5.885451335 |
| Mean | 103.9992444 |
| Median Absolute Deviation (MAD) | 51 |
| Skewness | 1.839084868 |
| Sum | 41992398.9 |
| Variance | 8006.27585 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 83 | 8144 | 2.0% |
| 6 | 4712 | 1.2% |
| 5 | 3547 | 0.9% |
| 18 | 3523 | 0.9% |
| 14 | 3493 | 0.9% |
| 16 | 3405 | 0.8% |
| 17 | 3383 | 0.8% |
| 13 | 3349 | 0.8% |
| 20 | 3336 | 0.8% |
| 24 | 3240 | 0.8% |
| Other values (1038) | 363644 |
| Value | Count | Frequency (%) |
| 2 | 103 | < 0.1% |
| 3 | 719 | 0.2% |
| 4 | 264 | 0.1% |
| 5 | 3547 | |
| 5.4 | 2 | < 0.1% |
| 5.6 | 1 | < 0.1% |
| 6 | 4712 | |
| 6.4 | 1 | < 0.1% |
| 6.6 | 1 | < 0.1% |
| 7 | 2245 |
| Value | Count | Frequency (%) |
| 999 | 3 | |
| 995 | 1 | < 0.1% |
| 993 | 1 | < 0.1% |
| 992 | 1 | < 0.1% |
| 991 | 1 | < 0.1% |
| 988 | 1 | < 0.1% |
| 987 | 1 | < 0.1% |
| 986 | 1 | < 0.1% |
| 984 | 1 | < 0.1% |
| 983 | 1 | < 0.1% |
| Distinct | 685 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.54324847 |
| Minimum | 0.2856 |
|---|---|
| Maximum | 500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0.2856 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2.2848 |
| median | 7 |
| Q3 | 19 |
| 95-th percentile | 60 |
| Maximum | 500 |
| Range | 499.7144 |
| Interquartile range (IQR) | 16.7152 |
Descriptive statistics
| Standard deviation | 21.53958095 |
|---|---|
| Coefficient of variation (CV) | 1.385783737 |
| Kurtosis | 14.36912969 |
| Mean | 15.54324847 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 3.050025452 |
| Sum | 6275990.695 |
| Variance | 463.9535476 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 97027 | |
| 3 | 31771 | 7.9% |
| 7 | 22415 | 5.6% |
| 4 | 20810 | 5.2% |
| 5 | 17091 | 4.2% |
| 6 | 15762 | 3.9% |
| 8 | 12722 | 3.2% |
| 9 | 10952 | 2.7% |
| 10 | 10096 | 2.5% |
| 11 | 8863 | 2.2% |
| Other values (675) | 156267 |
| Value | Count | Frequency (%) |
| 0.2856 | 89 | < 0.1% |
| 0.5712 | 70 | < 0.1% |
| 0.8568 | 72 | < 0.1% |
| 1 | 3221 | 0.8% |
| 1.1424 | 84 | < 0.1% |
| 1.428 | 94 | < 0.1% |
| 1.7136 | 83 | < 0.1% |
| 1.9992 | 110 | < 0.1% |
| 2 | 97027 | |
| 2.1 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 500 | 3 | |
| 411 | 1 | < 0.1% |
| 341 | 1 | < 0.1% |
| 315 | 1 | < 0.1% |
| 314 | 1 | < 0.1% |
| 310 | 1 | < 0.1% |
| 299 | 1 | < 0.1% |
| 282 | 1 | < 0.1% |
| 278 | 1 | < 0.1% |
| 277 | 1 | < 0.1% |
| Distinct | 1210 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 50.35278459 |
| Minimum | 1.0265 |
|---|---|
| Maximum | 290 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 1.0265 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 24 |
| median | 44 |
| Q3 | 70 |
| 95-th percentile | 116 |
| Maximum | 290 |
| Range | 288.9735 |
| Interquartile range (IQR) | 46 |
Descriptive statistics
| Standard deviation | 34.25747322 |
|---|---|
| Coefficient of variation (CV) | 0.6803491305 |
| Kurtosis | 1.338853417 |
| Mean | 50.35278459 |
| Median Absolute Deviation (MAD) | 22 |
| Skewness | 1.068509369 |
| Sum | 20331245.95 |
| Variance | 1173.574471 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50.35278459 | 11859 | 2.9% |
| 16 | 5572 | 1.4% |
| 22 | 5556 | 1.4% |
| 20 | 5523 | 1.4% |
| 17 | 5467 | 1.4% |
| 18 | 5441 | 1.3% |
| 26 | 5420 | 1.3% |
| 21 | 5416 | 1.3% |
| 19 | 5368 | 1.3% |
| 14 | 5366 | 1.3% |
| Other values (1200) | 342788 |
| Value | Count | Frequency (%) |
| 1.0265 | 3 | < 0.1% |
| 1.2318 | 2 | < 0.1% |
| 1.4371 | 2 | < 0.1% |
| 1.6424 | 3 | < 0.1% |
| 1.8477 | 1 | < 0.1% |
| 2 | 4364 | |
| 2.053 | 1 | < 0.1% |
| 2.2583 | 3 | < 0.1% |
| 2.4636 | 1 | < 0.1% |
| 2.6689 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 290 | 1 | |
| 285 | 1 | |
| 280 | 1 | |
| 277 | 2 | |
| 273 | 1 | |
| 270 | 1 | |
| 269 | 1 | |
| 265 | 1 | |
| 264 | 1 | |
| 263 | 2 |
| Distinct | 132 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1199.044874 |
| Minimum | 100 |
|---|---|
| Maximum | 10000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 100 |
|---|---|
| 5-th percentile | 200 |
| Q1 | 500 |
| median | 900 |
| Q3 | 1500 |
| 95-th percentile | 3399 |
| Maximum | 10000 |
| Range | 9900 |
| Interquartile range (IQR) | 1000 |
Descriptive statistics
| Standard deviation | 1097.868685 |
|---|---|
| Coefficient of variation (CV) | 0.9156193478 |
| Kurtosis | 10.15730245 |
| Mean | 1199.044874 |
| Median Absolute Deviation (MAD) | 400 |
| Skewness | 2.653987532 |
| Sum | 484145543 |
| Variance | 1205315.65 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 900 | 40916 | 10.1% |
| 300 | 30662 | 7.6% |
| 400 | 29849 | 7.4% |
| 500 | 28043 | 6.9% |
| 600 | 27189 | 6.7% |
| 700 | 25720 | 6.4% |
| 800 | 22728 | 5.6% |
| 1000 | 19026 | 4.7% |
| 200 | 17370 | 4.3% |
| 1100 | 17009 | 4.2% |
| Other values (122) | 145264 |
| Value | Count | Frequency (%) |
| 100 | 5091 | 1.3% |
| 150 | 1 | < 0.1% |
| 200 | 17370 | |
| 300 | 30662 | |
| 350 | 1 | < 0.1% |
| 400 | 29849 | |
| 500 | 28043 | |
| 600 | 27189 | |
| 700 | 25720 | |
| 800 | 22728 |
| Value | Count | Frequency (%) |
| 10000 | 51 | |
| 9900 | 25 | |
| 9800 | 24 | |
| 9700 | 23 | |
| 9600 | 23 | |
| 9500 | 22 | |
| 9400 | 25 | |
| 9300 | 31 | |
| 9200 | 31 | |
| 9100 | 31 |
| Distinct | 1597 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 57.69670856 |
| Minimum | 0.2142 |
|---|---|
| Maximum | 1071 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0.2142 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 12 |
| median | 45 |
| Q3 | 82 |
| 95-th percentile | 178 |
| Maximum | 1071 |
| Range | 1070.7858 |
| Interquartile range (IQR) | 70 |
Descriptive statistics
| Standard deviation | 56.49177386 |
|---|---|
| Coefficient of variation (CV) | 0.9791160582 |
| Kurtosis | 6.394631357 |
| Mean | 57.69670856 |
| Median Absolute Deviation (MAD) | 34.29 |
| Skewness | 1.680004333 |
| Sum | 23296546.2 |
| Variance | 3191.320514 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 40544 | 10.0% |
| 45 | 15583 | 3.9% |
| 3 | 8245 | 2.0% |
| 4 | 7636 | 1.9% |
| 1 | 6878 | 1.7% |
| 5 | 6129 | 1.5% |
| 6 | 5641 | 1.4% |
| 8 | 4796 | 1.2% |
| 7 | 4642 | 1.1% |
| 10 | 3940 | 1.0% |
| Other values (1587) | 299742 |
| Value | Count | Frequency (%) |
| 0.2142 | 134 | < 0.1% |
| 0.4284 | 119 | < 0.1% |
| 0.6426 | 118 | < 0.1% |
| 0.8568 | 120 | < 0.1% |
| 1 | 6878 | |
| 1.071 | 138 | < 0.1% |
| 1.2852 | 147 | < 0.1% |
| 1.4994 | 166 | < 0.1% |
| 1.7136 | 125 | < 0.1% |
| 1.9278 | 147 | < 0.1% |
| Value | Count | Frequency (%) |
| 1071 | 14 | |
| 1050 | 1 | < 0.1% |
| 1026 | 1 | < 0.1% |
| 674 | 1 | < 0.1% |
| 673 | 1 | < 0.1% |
| 500 | 5 | < 0.1% |
| 450 | 1 | < 0.1% |
| 444 | 1 | < 0.1% |
| 432 | 1 | < 0.1% |
| 429 | 1 | < 0.1% |
| Distinct | 1188 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.08889947 |
| Minimum | -19.9 |
|---|---|
| Maximum | 41.6 |
| Zeros | 2642 |
| Zeros (%) | 0.7% |
| Negative | 55474 |
| Negative (%) | 13.7% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -19.9 |
|---|---|
| 5-th percentile | -4 |
| Q1 | 4 |
| median | 15.4 |
| Q3 | 23.5 |
| 95-th percentile | 30.7 |
| Maximum | 41.6 |
| Range | 61.5 |
| Interquartile range (IQR) | 19.5 |
Descriptive statistics
| Standard deviation | 11.29983762 |
|---|---|
| Coefficient of variation (CV) | 0.8020383452 |
| Kurtosis | -1.086168918 |
| Mean | 14.08889947 |
| Median Absolute Deviation (MAD) | 9.4 |
| Skewness | -0.1687530123 |
| Sum | 5688759.474 |
| Variance | 127.6863303 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 3342 | 0.8% |
| 1 | 2796 | 0.7% |
| 0 | 2642 | 0.7% |
| 2 | 2556 | 0.6% |
| -1 | 2436 | 0.6% |
| -2 | 2293 | 0.6% |
| -4 | 1844 | 0.5% |
| 4 | 1772 | 0.4% |
| 5 | 1680 | 0.4% |
| -5 | 1633 | 0.4% |
| Other values (1178) | 380782 |
| Value | Count | Frequency (%) |
| -19.9 | 1 | |
| -19.7 | 1 | |
| -19.5 | 1 | |
| -18.9 | 1 | |
| -18.7 | 1 | |
| -18.5 | 1 | |
| -18.1 | 1 | |
| -17.9 | 1 | |
| -17.4 | 1 | |
| -17.3 | 1 |
| Value | Count | Frequency (%) |
| 41.6 | 1 | < 0.1% |
| 41.4 | 2 | < 0.1% |
| 41.1 | 3 | < 0.1% |
| 41 | 2 | < 0.1% |
| 40.9 | 1 | < 0.1% |
| 40.6 | 2 | < 0.1% |
| 40.5 | 8 | |
| 40.4 | 3 | < 0.1% |
| 40.3 | 4 | |
| 40.2 | 2 | < 0.1% |
| Distinct | 677 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1010.282534 |
| Minimum | 982.4 |
|---|---|
| Maximum | 1042.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 982.4 |
|---|---|
| 5-th percentile | 994.6 |
| Q1 | 1002 |
| median | 1009.8 |
| Q3 | 1018.3 |
| 95-th percentile | 1027.4 |
| Maximum | 1042.8 |
| Range | 60.4 |
| Interquartile range (IQR) | 16.3 |
Descriptive statistics
| Standard deviation | 10.35337882 |
|---|---|
| Coefficient of variation (CV) | 0.01024800338 |
| Kurtosis | -0.7814634981 |
| Mean | 1010.282534 |
| Median Absolute Deviation (MAD) | 8.2 |
| Skewness | 0.151997731 |
| Sum | 407927840.5 |
| Variance | 107.192453 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1019 | 2712 | 0.7% |
| 1018 | 2695 | 0.7% |
| 1021 | 2691 | 0.7% |
| 1015 | 2602 | 0.6% |
| 1023 | 2596 | 0.6% |
| 1020 | 2570 | 0.6% |
| 1017 | 2554 | 0.6% |
| 1016 | 2528 | 0.6% |
| 1022 | 2474 | 0.6% |
| 1024 | 2455 | 0.6% |
| Other values (667) | 377899 |
| Value | Count | Frequency (%) |
| 982.4 | 2 | < 0.1% |
| 982.7 | 2 | < 0.1% |
| 982.8 | 3 | |
| 982.9 | 2 | < 0.1% |
| 983 | 4 | |
| 983.2 | 4 | |
| 983.3 | 3 | |
| 983.4 | 2 | < 0.1% |
| 983.5 | 6 | |
| 983.6 | 4 |
| Value | Count | Frequency (%) |
| 1042.8 | 2 | < 0.1% |
| 1042.4 | 1 | < 0.1% |
| 1042.3 | 2 | < 0.1% |
| 1042.2 | 1 | < 0.1% |
| 1042 | 11 | |
| 1041.8 | 8 | |
| 1041.7 | 1 | < 0.1% |
| 1041.6 | 7 | |
| 1041.5 | 2 | < 0.1% |
| 1041.4 | 8 |
| Distinct | 646 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.157291447 |
| Minimum | -43.4 |
|---|---|
| Maximum | 29.1 |
| Zeros | 828 |
| Zeros (%) | 0.2% |
| Negative | 168595 |
| Negative (%) | 41.8% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | -43.4 |
|---|---|
| 5-th percentile | -19.4 |
| Q1 | -8 |
| median | 4.1 |
| Q3 | 15.5 |
| 95-th percentile | 22.2 |
| Maximum | 29.1 |
| Range | 72.5 |
| Interquartile range (IQR) | 23.5 |
Descriptive statistics
| Standard deviation | 13.61273596 |
|---|---|
| Coefficient of variation (CV) | 4.311523402 |
| Kurtosis | -1.076908329 |
| Mean | 3.157291447 |
| Median Absolute Deviation (MAD) | 11.7 |
| Skewness | -0.2501055805 |
| Sum | 1274838.511 |
| Variance | 185.3065803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 17.6 | 1559 | 0.4% |
| 17 | 1519 | 0.4% |
| 17.2 | 1490 | 0.4% |
| 16.8 | 1483 | 0.4% |
| 17.3 | 1455 | 0.4% |
| 17.1 | 1445 | 0.4% |
| 17.8 | 1440 | 0.4% |
| 16.2 | 1429 | 0.4% |
| 18.2 | 1426 | 0.4% |
| 17.5 | 1409 | 0.3% |
| Other values (636) | 389121 |
| Value | Count | Frequency (%) |
| -43.4 | 1 | < 0.1% |
| -36 | 1 | < 0.1% |
| -35.7 | 1 | < 0.1% |
| -35.5 | 1 | < 0.1% |
| -35.3 | 7 | |
| -35.1 | 9 | |
| -35 | 6 | |
| -34.9 | 2 | < 0.1% |
| -34.8 | 7 | |
| -34.6 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 29.1 | 2 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28.8 | 10 | |
| 28.7 | 12 | |
| 28.6 | 2 | < 0.1% |
| 28.5 | 12 | |
| 28.4 | 14 | |
| 28.3 | 14 | |
| 28.2 | 9 | |
| 28.1 | 9 |
| Distinct | 254 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06705178246 |
| Minimum | 0 |
|---|---|
| Maximum | 72.5 |
| Zeros | 387119 |
| Zeros (%) | 95.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 72.5 |
| Range | 72.5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8375740317 |
|---|---|
| Coefficient of variation (CV) | 12.49145065 |
| Kurtosis | 1292.745862 |
| Mean | 0.06705178246 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 29.4497644 |
| Sum | 27073.90052 |
| Variance | 0.7015302586 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 387119 | |
| 0.1 | 3689 | 0.9% |
| 0.2 | 1823 | 0.5% |
| 0.3 | 1374 | 0.3% |
| 0.4 | 885 | 0.2% |
| 0.5 | 847 | 0.2% |
| 0.6 | 698 | 0.2% |
| 0.7 | 585 | 0.1% |
| 0.9 | 502 | 0.1% |
| 0.8 | 482 | 0.1% |
| Other values (244) | 5772 | 1.4% |
| Value | Count | Frequency (%) |
| 0 | 387119 | |
| 0.06705178246 | 261 | 0.1% |
| 0.1 | 3689 | 0.9% |
| 0.2 | 1823 | 0.5% |
| 0.3 | 1374 | 0.3% |
| 0.4 | 885 | 0.2% |
| 0.5 | 847 | 0.2% |
| 0.6 | 698 | 0.2% |
| 0.7 | 585 | 0.1% |
| 0.8 | 482 | 0.1% |
| Value | Count | Frequency (%) |
| 72.5 | 3 | |
| 52.1 | 2 | < 0.1% |
| 47.7 | 1 | < 0.1% |
| 46.4 | 6 | |
| 45.9 | 2 | < 0.1% |
| 41.9 | 1 | < 0.1% |
| 40.7 | 3 | |
| 39 | 1 | < 0.1% |
| 38.9 | 1 | < 0.1% |
| 37.4 | 2 | < 0.1% |
wd
Categorical
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| NE | |
|---|---|
| ENE | |
| N | |
| NW | |
| E | |
| Other values (11) |
Length
| Max length | 3 |
|---|---|
| Median length | 2 |
| Mean length | 2.237356851 |
| Min length | 1 |
Characters and Unicode
| Total characters | 903391 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NNW |
|---|---|
| 2nd row | N |
| 3rd row | NNW |
| 4th row | NW |
| 5th row | N |
Common Values
| Value | Count | Frequency (%) |
| NE | 41438 | 10.3% |
| ENE | 33262 | 8.2% |
| N | 29973 | 7.4% |
| NW | 29587 | 7.3% |
| E | 29168 | 7.2% |
| NNE | 27247 | 6.7% |
| SW | 27083 | 6.7% |
| NNW | 24167 | 6.0% |
| WNW | 23815 | 5.9% |
| ESE | 23691 | 5.9% |
| Other values (6) | 114345 |
Length
| Value | Count | Frequency (%) |
| ne | 41438 | 10.3% |
| ene | 33262 | 8.2% |
| n | 29973 | 7.4% |
| nw | 29587 | 7.3% |
| e | 29168 | 7.2% |
| nne | 27247 | 6.7% |
| sw | 27083 | 6.7% |
| nnw | 24167 | 6.0% |
| wnw | 23815 | 5.9% |
| ese | 23691 | 5.9% |
| Other values (6) | 114345 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 260903 | |
| E | 248114 | |
| W | 207045 | |
| S | 187329 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 903391 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 260903 | |
| E | 248114 | |
| W | 207045 | |
| S | 187329 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 903391 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 260903 | |
| E | 248114 | |
| W | 207045 | |
| S | 187329 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 903391 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 260903 | |
| E | 248114 | |
| W | 207045 | |
| S | 187329 |
| Distinct | 115 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.718192017 |
| Minimum | 0 |
|---|---|
| Maximum | 13.2 |
| Zeros | 10891 |
| Zeros (%) | 2.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 3.1 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.3 |
| Q1 | 0.9 |
| median | 1.4 |
| Q3 | 2.2 |
| 95-th percentile | 4.2 |
| Maximum | 13.2 |
| Range | 13.2 |
| Interquartile range (IQR) | 1.3 |
Descriptive statistics
| Standard deviation | 1.237624097 |
|---|---|
| Coefficient of variation (CV) | 0.7203060454 |
| Kurtosis | 3.695959966 |
| Mean | 1.718192017 |
| Median Absolute Deviation (MAD) | 0.6 |
| Skewness | 1.626099043 |
| Sum | 693764.7 |
| Variance | 1.531713406 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.1 | 21486 | 5.3% |
| 1 | 21370 | 5.3% |
| 1.2 | 21228 | 5.3% |
| 0.9 | 20237 | 5.0% |
| 1.3 | 19640 | 4.9% |
| 0.8 | 18585 | 4.6% |
| 1.4 | 18014 | 4.5% |
| 0.7 | 16969 | 4.2% |
| 1.5 | 16273 | 4.0% |
| 1.6 | 15098 | 3.7% |
| Other values (105) | 214876 |
| Value | Count | Frequency (%) |
| 0 | 10891 | |
| 0.1 | 4175 | 1.0% |
| 0.2 | 4378 | 1.1% |
| 0.3 | 2673 | 0.7% |
| 0.4 | 7154 | 1.8% |
| 0.5 | 10842 | |
| 0.6 | 13881 | |
| 0.7 | 16969 | |
| 0.8 | 18585 | |
| 0.9 | 20237 |
| Value | Count | Frequency (%) |
| 13.2 | 1 | < 0.1% |
| 12.9 | 1 | < 0.1% |
| 12.8 | 1 | < 0.1% |
| 11.8 | 1 | < 0.1% |
| 11.7 | 1 | < 0.1% |
| 11.2 | 3 | |
| 11 | 1 | < 0.1% |
| 10.9 | 3 | |
| 10.7 | 1 | < 0.1% |
| 10.5 | 3 |
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.1 MiB |
| Dingling | |
|---|---|
| Wanshouxigong | |
| Guanyuan | |
| Huairou | |
| Wanliu | |
| Other values (7) |
Length
| Max length | 13 |
|---|---|
| Median length | 7.5 |
| Mean length | 8.416666667 |
| Min length | 6 |
Characters and Unicode
| Total characters | 3398448 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aotizhongxin |
|---|---|
| 2nd row | Aotizhongxin |
| 3rd row | Aotizhongxin |
| 4th row | Aotizhongxin |
| 5th row | Aotizhongxin |
Common Values
| Value | Count | Frequency (%) |
| Dingling | 33648 | |
| Wanshouxigong | 33648 | |
| Guanyuan | 33648 | |
| Huairou | 33648 | |
| Wanliu | 33648 | |
| Aotizhongxin | 33648 | |
| Dongsi | 33648 | |
| Nongzhanguan | 33648 | |
| Shunyi | 33648 | |
| Tiantan | 33648 | |
| Other values (2) | 67296 |
Length
| Value | Count | Frequency (%) |
| changping | 33648 | |
| dongsi | 33648 | |
| shunyi | 33648 | |
| gucheng | 33648 | |
| nongzhanguan | 33648 | |
| guanyuan | 33648 | |
| huairou | 33648 | |
| dingling | 33648 | |
| wanliu | 33648 | |
| aotizhongxin | 33648 | |
| Other values (2) | 67296 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2994672 | |
| Uppercase Letter | 403776 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 7.9% |
| h | 201888 | 6.7% |
| t | 67296 | 2.2% |
| z | 67296 | 2.2% |
| x | 67296 | 2.2% |
| Other values (7) | 336480 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 67296 | |
| G | 67296 | |
| W | 67296 | |
| A | 33648 | |
| C | 33648 | |
| H | 33648 | |
| N | 33648 | |
| S | 33648 | |
| T | 33648 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3398448 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3398448 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 639312 | |
| i | 370128 | |
| g | 370128 | |
| a | 336480 | |
| u | 302832 | |
| o | 235536 | 6.9% |
| h | 201888 | 5.9% |
| t | 67296 | 2.0% |
| z | 67296 | 2.0% |
| x | 67296 | 2.0% |
| Other values (16) | 740256 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| REF_NO | year | month | day | hour | PM2.5 | PM10 | SO2 | NO2 | CO | O3 | TEMP | PRES | DEWP | RAIN | wd | WSPM | station | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2013 | 3 | 1 | 0 | 4.0 | 4.0 | 4.0 | 7.0 | 300.0 | 77.0 | -0.7 | 1023.0 | -18.8 | 0.0 | NNW | 4.4 | Aotizhongxin |
| 1 | 2 | 2013 | 3 | 1 | 1 | 8.0 | 8.0 | 4.0 | 7.0 | 300.0 | 77.0 | -1.1 | 1023.2 | -18.2 | 0.0 | N | 4.7 | Aotizhongxin |
| 2 | 3 | 2013 | 3 | 1 | 2 | 7.0 | 7.0 | 5.0 | 10.0 | 300.0 | 73.0 | -1.1 | 1023.5 | -18.2 | 0.0 | NNW | 5.6 | Aotizhongxin |
| 3 | 4 | 2013 | 3 | 1 | 3 | 6.0 | 6.0 | 11.0 | 11.0 | 300.0 | 72.0 | -1.4 | 1024.5 | -19.4 | 0.0 | NW | 3.1 | Aotizhongxin |
| 4 | 5 | 2013 | 3 | 1 | 4 | 3.0 | 3.0 | 12.0 | 12.0 | 300.0 | 72.0 | -2.0 | 1025.2 | -19.5 | 0.0 | N | 2.0 | Aotizhongxin |
| 5 | 6 | 2013 | 3 | 1 | 5 | 5.0 | 5.0 | 18.0 | 18.0 | 400.0 | 66.0 | -2.2 | 1025.6 | -19.6 | 0.0 | N | 3.7 | Aotizhongxin |
| 6 | 7 | 2013 | 3 | 1 | 6 | 3.0 | 3.0 | 18.0 | 32.0 | 500.0 | 50.0 | -2.6 | 1026.5 | -19.1 | 0.0 | NNE | 2.5 | Aotizhongxin |
| 7 | 8 | 2013 | 3 | 1 | 7 | 3.0 | 6.0 | 19.0 | 41.0 | 500.0 | 43.0 | -1.6 | 1027.4 | -19.1 | 0.0 | NNW | 3.8 | Aotizhongxin |
| 8 | 9 | 2013 | 3 | 1 | 8 | 3.0 | 6.0 | 16.0 | 43.0 | 500.0 | 45.0 | 0.1 | 1028.3 | -19.2 | 0.0 | NNW | 4.1 | Aotizhongxin |
| 9 | 10 | 2013 | 3 | 1 | 9 | 3.0 | 8.0 | 12.0 | 28.0 | 400.0 | 59.0 | 1.2 | 1028.5 | -19.3 | 0.0 | N | 2.6 | Aotizhongxin |
Last rows
| REF_NO | year | month | day | hour | PM2.5 | PM10 | SO2 | NO2 | CO | O3 | TEMP | PRES | DEWP | RAIN | wd | WSPM | station | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 403766 | 33639 | 2016 | 12 | 31 | 14 | 399.0 | 412.0 | 31.0 | 198.0 | 4900.0 | 6.0 | 3.8 | 1021.9 | -8.9 | 0.0 | SSE | 1.0 | Wanshouxigong |
| 403767 | 33640 | 2016 | 12 | 31 | 15 | 449.0 | 524.0 | 30.0 | 217.0 | 5600.0 | 8.0 | 3.9 | 1021.5 | -6.1 | 0.0 | S | 1.4 | Wanshouxigong |
| 403768 | 33641 | 2016 | 12 | 31 | 16 | 440.0 | 440.0 | 26.0 | 200.0 | 4700.0 | 6.0 | 2.8 | 1021.5 | -6.6 | 0.0 | SSE | 0.7 | Wanshouxigong |
| 403769 | 33642 | 2016 | 12 | 31 | 17 | 378.0 | 378.0 | 20.0 | 171.0 | 3800.0 | 4.0 | 1.2 | 1021.4 | -5.5 | 0.0 | SSE | 1.1 | Wanshouxigong |
| 403770 | 33643 | 2016 | 12 | 31 | 18 | 392.0 | 458.0 | 14.0 | 160.0 | 3900.0 | 3.0 | -1.3 | 1021.9 | -6.5 | 0.0 | S | 0.6 | Wanshouxigong |
| 403771 | 33644 | 2016 | 12 | 31 | 19 | 449.0 | 487.0 | 10.0 | 153.0 | 4500.0 | 4.0 | -1.9 | 1022.0 | -6.1 | 0.0 | ESE | 0.9 | Wanshouxigong |
| 403772 | 33645 | 2016 | 12 | 31 | 20 | 460.0 | 492.0 | 12.0 | 146.0 | 4100.0 | 4.0 | -2.5 | 1022.4 | -5.5 | 0.0 | ENE | 0.7 | Wanshouxigong |
| 403773 | 33646 | 2016 | 12 | 31 | 21 | 463.0 | 498.0 | 12.0 | 141.0 | 4400.0 | 5.0 | -3.0 | 1022.1 | -5.3 | 0.0 | E | 0.9 | Wanshouxigong |
| 403774 | 33647 | 2016 | 12 | 31 | 22 | 493.0 | 537.0 | 12.0 | 124.0 | 5000.0 | 8.0 | -3.0 | 1022.7 | -5.0 | 0.0 | SW | 0.1 | Wanshouxigong |
| 403775 | 33648 | 2016 | 12 | 31 | 23 | 464.0 | 490.0 | 8.0 | 111.0 | 5400.0 | 7.0 | -4.0 | 1022.6 | -5.7 | 0.0 | ENE | 0.9 | Wanshouxigong |